Towards Neurocomputational Speech and Sound Processing
نویسندگان
چکیده
From physiology we learn that the auditory system extracts simultaneous features from the underlying signal, giving birth to simultaneous representations of audible signals. We also learn that pattern analysis and recognition are not separated processes (in opposition to the engineering approach of pattern recognition where analysis and recognition are usually separated processes). Furthermore, in the visual system, it has been observed that the sequence order of firing is crucial to perform fast visual recognition tasks (Rank Order Coding). The use of the Rank Order Coding has also been recently hypothesized in the mammalian auditory system. In a first application we compare a very simplistic speech recognition prototype that uses the Rank Order Coding with a conventional Hidden Markov Model speech recognizer. It is also shown that the type of neurons being used should be adapted to the type of phonemes (consonants/transients or vowels/stable) to be recognized. In a second application, we combine a simultaneous auditory images representation with a network of oscillatory spiking neurons to segregate and bind auditory objects for acoustical source separation. It is shown that the spiking neural network performs unsupervised auditory images segmentation (to find ’auditory’ objects) and binding of the objects belonging to the same auditory source (yielding automatic sound source separation). keywords: Auditory modelling, Source separation, Amplitude Modulation, Auditory Scene Analysis, Spiking Neurons, Temporal Correlation, Cochlear Nucleus, Corrupted Speech Processing, Rank Order Coding, Speech recognition. ? This work has been funded by NSERC and Université de Sherbrooke. S. Loiselle has been funded by FQRNT of Québec for the year 2006. 2 J. Rouat, S. Loiselle and R. Pichevar
منابع مشابه
Towards a neurocomputational model of speech production and perception
The limitation in performance of current speech synthesis and speech recognition systems may result from the fact that these systems are not designed with respect to the human neural processes of speech production and perception. A neurocomputational model of speech production and perception is introduced which is organized with respect to human neural processes of speech production and percept...
متن کاملDynamic and task-dependent encoding of speech and voice by phase reorganization of cortical oscillations.
Speech and vocal sounds are at the core of human communication. Cortical processing of these sounds critically depends on behavioral demands. However, the neurocomputational mechanisms enabling this adaptive processing remain elusive. Here we examine the task-dependent reorganization of electroencephalographic responses to natural speech sounds (vowels /a/, /i/, /u/) spoken by three speakers (t...
متن کاملThe influence of (central) auditory processing disorder in speech sound disorders.
INTRODUCTION Considering the importance of auditory information for the acquisition and organization of phonological rules, the assessment of (central) auditory processing contributes to both the diagnosis and targeting of speech therapy in children with speech sound disorders. OBJECTIVE To study phonological measures and (central) auditory processing of children with speech sound disorder. ...
متن کاملConstructing Cerebellum Model by Researching on its Contributions to DIVA
DIVA (Directions into Velocities of Articulators) is a mathematical model of the processes behind speech acquisition and production, supposed to achieve a functional representation of areas in the brain that are involved in speech production and speech perception. Introducing cerebellum control mechanism into the model plays a significant role in improving the mechanism of speech acquisition an...
متن کاملEffects of sound pillow in the treatment of stuttering and cognitive phonemes impairment in children
Introduction:Verbal language is Fundamental component for expressing ideas, social interaction and understanding educational materials. Effective communications require verbal language skills. Sound pillows may partly address the children with behavior problems. The purpose of this study was assessing the effect of educational sound pillow in the treatment of stuttering and cognitive phonemes i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005